PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bostr.30275s0233.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Boechereae; Boechera
Family HD-ZIP
Protein Properties Length: 717aa    MW: 80100.5 Da    PI: 6.0586
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bostr.30275s0233.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox55.11.3e-1790144256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           ++ +++t+ q++++e+lFe+n++p+ ++r +L k+lgLt  qVk+WFqN+R++ k
  Bostr.30275s0233.1.p  90 KRSHRHTARQIQQMEALFEENPHPDDSKRLRLGKELGLTPLQVKFWFQNKRTQIK 144
                           455789*********************************************9877 PP

2START138.85.6e-442404701206
                           HHHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S. CS
                 START   1 elaeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv...........dsgealrasgvvdmvlallveellddkeqWdetla. 77 
                           ela ++aqelvk+ + +ep+W+k +  +n+ ++l ++e +k            ++ ea++a +vv m++ +lv+ +ld   +W+e +  
  Bostr.30275s0233.1.p 240 ELAVSCAQELVKMCDINEPLWTKKR-LDNENVCLNEEEYKKMflwppmddddrFRREASKANAVVMMNSITLVKAFLDAD-KWSELFCs 326
                           57899********************.6777777777777777999999999999**************************.****9999 PP

                           ...EEEEEEEECTT.....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-......TTS--....-TTSEE-EE CS
                 START  78 ...kaetlevissg.....galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvds......eqkppe...sssvvRael 147
                              +a+t++ issg     g+l lm+a lq+ splvp R+ +f+Ry +q  ++g+w+ivd  +ds      ++   +   +  + R  +
  Bostr.30275s0233.1.p 327 ivlSAKTIQIISSGvsgasGTLLLMYAGLQVVSPLVPtREAYFLRYVEQnAEEGKWTIVDFPIDSfhgfikPA---SaatTTDLYR--R 410
                           999*********************************************************9998722222222...2344677777..8 PP

                           SSEEEEEEEECTCEEEEEEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 148 lpSgiliepksnghskvtwvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                            pSg++i++++ng+s+vtwvehv+++++++ ++++r  vksg+a+ga +w+a l+rqce+
  Bostr.30275s0233.1.p 411 KPSGCIIQEMPNGYSQVTWVEHVEVEEKHVqDEAVREYVKSGVAFGAERWLALLKRQCER 470
                           ******************************99**************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466899.41E-1875147IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.3E-1984150IPR009057Homeodomain-like
PROSITE profilePS5007116.55286146IPR001356Homeobox domain
SMARTSM003891.1E-1687150IPR001356Homeobox domain
CDDcd000861.83E-1689147No hitNo description
PfamPF000463.3E-1590144IPR001356Homeobox domain
PROSITE profilePS5084841.707231473IPR002913START domain
SuperFamilySSF559618.52E-30232472No hitNo description
CDDcd088753.15E-104235469No hitNo description
SMARTSM002342.3E-28240470IPR002913START domain
PfamPF018528.5E-37241470IPR002913START domain
SuperFamilySSF559613.16E-6514705No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 717 aa     Download sequence    Send to blast
MYGDYQVLKS IEEEGHAVLN SDNIFGSTSS SPTATIQNPN FKFTSIDNPN FPYIIPKEEY  60
RMLSMIESGS GHDPVENTAI EQEPPPAKKK RSHRHTARQI QQMEALFEEN PHPDDSKRLR  120
LGKELGLTPL QVKFWFQNKR TQIKAQQDRR DNVLLKAEND TLKIESQNLQ SSLQCLSCSF  180
CGYNLRLENT RLRQELDRLR RIASMRNPPP SQEIACFFPE TNNNNNNMLI AEEEKAIAME  240
LAVSCAQELV KMCDINEPLW TKKRLDNENV CLNEEEYKKM FLWPPMDDDD RFRREASKAN  300
AVVMMNSITL VKAFLDADKW SELFCSIVLS AKTIQIISSG VSGASGTLLL MYAGLQVVSP  360
LVPTREAYFL RYVEQNAEEG KWTIVDFPID SFHGFIKPAS AATTTDLYRR KPSGCIIQEM  420
PNGYSQVTWV EHVEVEEKHV QDEAVREYVK SGVAFGAERW LALLKRQCER MASLMATCIT  480
DLGVIPSVEA RKNLMKLSQI MVRTFCLNIS NSYGQASTKN TVRIVTRKVC GGLVPCAVSV  540
TYLPYSHHKV FVLLRDNKFL SQLEILFNGS SFQEVAHIAN GSHPGNCISL LRINEESSSS  600
HNVELMLQET CTDDSGSLLV YSTVDPDVVQ LAMNGEDPCK IPLLPVGFSV VPVNPSDGVE  660
GISVNLPSCL LTVSIQVLGS NVATARLDLS TVSAINNRIC ATVNRITSAL VNHVGN*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18892KKRSH
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY1337000.0AY133700.1 Arabidopsis thaliana clone C103238 putative GLABRA2 protein (At4g17710) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqNP_193506.20.0homeobox-leucine zipper protein HDG4
SwissprotQ8L7H40.0HDG4_ARATH; Homeobox-leucine zipper protein HDG4
TrEMBLR0GP440.0R0GP44_9BRAS; Uncharacterized protein
STRINGAT4G17710.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43562548
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17710.10.0homeodomain GLABROUS 4